Weakly-supervised pre-training for 3D human pose estimation via perspective knowledge
نویسندگان
چکیده
Modern deep learning-based 3D pose estimation approaches require plenty of annotations. However, existing datasets lack diversity, which limits the performance current methods and their generalization ability. Although utilize 2D annotations to help estimation, they mainly focus on extracting structural constraints from poses, ignoring information hidden in images. In this paper, we propose a novel method extract weak directly images without supervision. Firstly, perspective prior knowledge generate relative depth human joints. Then, collect dataset (MCPC) labels. Based MCPC, weakly-supervised pre-training (WSP) strategy distinguish relationship between two points an image. WSP enables learning keypoints lots in-the-wild images, is more capable predicting ability for estimation. After fine-tuning model datasets, achieves state-of-the-art results widely-used benchmarks.
منابع مشابه
Hand pose estimation through semi-supervised and weakly-supervised learning
We propose a method for hand pose estimation based on a deep regressor trained on two different kinds of input. Raw depth data is fused with an intermediate representation in the form of a segmentation of the hand into parts. This intermediate representation contains important topological information and provides useful cues for reasoning about joint locations. The mapping from raw depth to seg...
متن کاملConditional Models for 3d Human Pose Estimation
OF THE DISSERTATION Conditional Models for 3D Human Pose Estimation by ATUL KANAUJIA Dissertation Director: Dimitris Metaxas Human 3d pose estimation from monocular sequence is a challenging problem, owing to highly articulated structure of human body, varied anthropometry, self occlusion, depth ambiguities and large variability in the appearance and background in which humans may appear. Conve...
متن کاملSelf Adversarial Training for Human Pose Estimation
This paper presents a deep learning based approach to the problem of human pose estimation. We employ generative adversarial networks as our learning paradigm in which we set up two stacked hourglass networks with the same architecture, one as the generator and the other as the discriminator. The generator is used as a human pose estimator after the training is done. The discriminator distingui...
متن کاملWeakly Supervised PLDA Training
PLDA is a popular normalization approach for the i-vector model, and it has delivered state-of-the-art performance in speaker verification. However, PLDA training requires a large amount of labelled development data, which is highly expensive in most cases. We present a cheap PLDA training approach, which assumes that speakers in the same session can be easily separated, and speakers in differe...
متن کامل3D Human Body Pose Estimation by Superquadrics
Abstract: This paper presents a method for 3D Human Body pose estimation by using a multi-camera system. The pose is estimated by RANSAC-object search with a robust least square fitting of 3D points to SuperQuadric (SQ) models of the searched object. The solution is verified by evaluating the matching score between the SQ object model and 3D real data captured by a multi-camera system and segme...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: Pattern Recognition
سال: 2023
ISSN: ['1873-5142', '0031-3203']
DOI: https://doi.org/10.1016/j.patcog.2023.109497